Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification

نویسندگان

چکیده

RGB-Infrared person re-identification (RGB-IR ReID) aims to associate people across disjoint RGB and IR camera views. Currently, state-of-the-art performance of RGB-IR ReID is not as impressive that conventional ReID. Much due the notorious modality bias training issue brought by single-modality ImageNet pre-training, which might yield RGB-biased representations severely hinder cross-modality image retrieval. This paper makes first attempt tackle task from a pre-training perspective. We propose self-supervised solution, named Modality-Aware Multiple Granularity Learning (MMGL), directly trains models scratch only on multi-modal datasets, but achieving competitive results against without using any external data or sophisticated tuning tricks. First, we develop simple-but-effective ‘permutation recovery’ pretext globally maps shuffled images into shared latent permutation space, providing modality-invariant global for downstream tasks. Second, present part-aware cycle-contrastive (PCC) learning strategy utilizes cycle-consistency maximize agreement between semantically similar patches. enables contrastive unpaired scenarios, further improving discriminability local features laborious instance augmentation. Based these designs, MMGL effectively alleviates problem. Extensive experiments demonstrate it learns better (+8.03% Rank-1 accuracy) with faster speed (converge in few hours) higher efficiency (<5% size) than pre-training. The also suggest generalizes well various existing models, losses has promising transferability datasets. code will be released at https://github.com/hansonchen1996/MMGL.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supplementary Material for “RGB-Infrared Cross-Modality Person Re-Identification”

This supplementary material accompanies the paper “RGB-Infrared Cross-Modality Person Re-Identification”. It includes more details of Section 4, as well as extra evaluations of our proposed deep zero-padding method. 1. Details of Counting Domain-Specific Nodes In the third paragraph of Section 4.2 in the main manuscript, we quantify the number of domain-specific nodes in the trained network in ...

متن کامل

Resource Aware Person Re-identification across Multiple Resolutions

Not all people are equally easy to identify: color statistics might be enough for some cases while others might require careful reasoning about highand low-level details. However, prevailing person re-identification(re-ID) methods use one-size-fits-all high-level embeddings from deep convolutional networks for all cases. This might limit their accuracy on difficult examples or makes them needle...

متن کامل

for “ Scalable Person Re - identification on Supervised Smoothed Manifold ”

The document contains the supplementary materials for “Scalable Person Re-identification on Supervised Smoothed Manifold”. Due to the space limitation, the comparison with state-of-the-art on PRID450S dataset [4] has to be simplified in the main paper. The only goal of this document is to present this comparison in Table 1. As can be seen, SSM sets a new state-of-the-art performance on this dat...

متن کامل

Physics-aware Self-supervised Training of CNNs for Object Detection

Impressive progress has been achieved recently in object detection with the use of deep learning. Nevertheless, such tools typically require large amounts of training data and significant manual effort for labeling objects. This limits their applicability in realistic domains, where it is necessary to scale solutions to a large number of objects and a variety of conditions. This work proposes a...

متن کامل

Deep self-paced learning for person re-identification

Person re-identification (Re-ID) usually suffers from noisy samples with background clutter and mutual occlusion, which makes it extremely difficult to distinguish different individuals across the disjoint camera views. In this paper, we propose a novel deep selfpaced learning (DSPL) algorithm to alleviate this problem, in which we apply a self-paced constraint and symmetric regularization to h...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Information Forensics and Security

سال: 2023

ISSN: ['1556-6013', '1556-6021']

DOI: https://doi.org/10.1109/tifs.2023.3273911